Salesforce has released an open source multimodal AI model named xGen-MM, designed to simultaneously understand and generate various data types such as text and images, significantly changing the way AI research and applications are conducted. The model performs exceptionally well in multiple benchmark tests, demonstrating strong performance compared to similar open source models, and includes pretrained models, datasets, and fine-tuning code. The largest model boasts 4 billion parameters and can handle 'interleaved data,' enabling multitasking, such as answering questions about multiple images at once. The diverse options available in the model reflect the potential of AI.